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We consider the estimation of parametric fractional time series 
models in which not only is the memory parameter unknown, but one 
fT^ ' niay not know whether it lies in the stationary/invertible region or 

the nonstationary or noninvertible regions. In these circumstances, 
a proof of consistency (which is a prerequisite for proving asymp- 
totic normality) can be difficult owing to nonuniform convergence 
of the objective function over a large admissible parameter space. 
In particular, this is the case for the conditional sum of squares es- 



H 

,J^ \ timate, which can be expected to be asymptotically efficient under 



cd 



Gaussianity. Without the latter assumption, we establish consistency 
and asymptotic normality for this estimate in case of a quite general 
univariate model. For a multivariate model, we establish asymptotic 
normality of a one-step estimate based on an initial -yrl-consistent 
estimate. 
>' 

TT^ , 1. Introduction. Autoregressive moving average (ARMA) models have 

[~^ \ featured prominently in the analysis of time series. The versions initially 

CN ' stressed in the theoretical literature (e.g., [11, 26]) are stationary and in- 

Cf^ \ vertible. Following [6], unit root nonstationarity has frequently been incor- 

^^ ' porated, while "overdifferenced" noninvertible processes have also featured. 

^_J . Stationary ARMA processes automatically have short memory with "mem- 

ory parameter," denoted Jq; taking the value zero, implying a huge behav- 
.^_, ioral gap relative to unit root versions, where 5q = \. This has been bridged 

^ \ by "fractionally-differenced," or long memory, models, a leading class be- 

H ; ing the fractional autoregressive integrated ARMA (FARIMA). A FARIMA 
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{pi,So,p2) process xj is given by 

(1.1) xt = A-*{uil(t>0)}, t = 0,±l,..., 

(1.2) a{L)ut = ^{L)et, t = 0,±l,..., 

where {xt} is the observable series; L is the lag operator; A = 1 — L; 

{i-Lr^ = ta,icW, «.(C) = r(c)r("^^\) 

with r(C) = oo for ( = 0,-1, ... , and by convention r(0)/r(0) = 1; l(-) is 
the indicator function; a{L) and /3(L) are real polynomials of degrees pi 
and p2, which share no common zeros, and all of their zeros are outside the 
unit circle in the complex plane; and the et are serially uncorrelated and 
homoscedastic with zero mean. The reason (1.1) features the truncated pro- 
cess utl{t > 0) rather than simply ut is to simultaneously cover 6q falling 
in both the stationary region (Sq < ^) and the nonstationary region {So> ^, 
where otherwise the process would "blow up"). In the former case, the trun- 
cation implies that xt is only "asymptotically stationary." In recent years, 
fractional modeling has found many applications in the sciences and social 
sciences; for example, with respect to environmental and financial data. 

Early work on asymptotic statistical theory for fractional models assumed 
(5o < I [and replaced utt{t > 0) by ut in (1.1)]. Assuming Sq G (0, ^), [8-10] 
and [12] showed consistency and asymptotic normality of Whittle estimates 
(of (5o and other parameters, such as the coefficients of a and /3), thereby 
achieving analogous results to those of [11, 26] for stationary ARMA pro- 
cesses [i.e., (1.2) with ut = xt] and other short memory models. More re- 
cently, [16] considered empirical maximum likelihood inference covering this 
setting. Note that [8-10] and [12], and much other work, not only excluded 
(^0 ^ ^ but also the short-memory case 6o = 0, as well as negatively depen- 
dent processes where 6o < 0. To some degree, other 6o can be covered, for 
example, for 6q S (1,3/2) one can first-difference the data, apply the meth- 
ods and theory of [8-10] and [12] , and then add 1 to the memory parameter 
estimate, but this still requires prior knowledge that 5o lies in an interval of 
length no more than ^. 

On the other hand, [3] argued that the same desirable properties should 
hold without so restricting 5o, in case of a conditional-sum-of-squares esti- 
mate, and this would be consistent with the classical asymptotic properties 
established by [18] for score tests for a unit root and other hypotheses against 
fractional alternatives, by comparison with the nonstandard behavior of unit 
root tests against stationary autoregressive alternatives. However, the proof 
of asymptotic normality in [3] appears to assume that the estimate lies in 
a small neighborhood of 6o, without first proving consistency (see also [24]). 
Due to a lack of uniform convergence, consistency of this implicitly-defined 
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estimate is especially difficult to establish when the set of admissible values 
of 5 is large. In particular, this is the case when 60 is known only to lie 
in an interval of length greater than ^. In the present paper, we establish 
consistency and asymptotic normality when the interval is arbitrarily large, 
including (simultaneously) stationary, nonstationary, invertible and nonin- 
vertible values of 6q. Thus, prior knowledge of which of these phenomena 
obtains is unnecessary, and this seems especially practically desirable given, 
for example, that estimates near the Sq = ^ or 60 = 1 boundaries frequently 
occur in practice, while empirical interest in autoregressive models with two 
unit roots suggests allowance for values in the region oi Sq = 2 also, and 
(following [2]) antipersistence and the possibility of overdifferencing imply 
the possibility that 5o <0. 

We in fact consider a more general model than (1.1), (1.2), retaining (1.1) 
but generalizing (1.2) to 

(1.3) ut = 0{L;<^o)^t, t = 0,±l,..., 

where £t is a zero-mean unobservable white noise sequence, (^g is an un- 
known p X 1 vector, 6{s;(p) = Yl'jLo^jiv)'^'' ^ where for all (f, 0o{^) = 1, 
9{s;(p):C x W is continuous in s and |^(s;(^)| 7^0, |s| < 1. More detailed 
conditions will be imposed below. The role of 9 in (1.3), like a and f3 
in (1.2), is to permit parametric short memory autocorrelation. We allow 
for the simplest case FARIMA(0,(5o)0) by taking ^Jq to be empty. Another 
model covered by (1.3) is the exponential-spectrum one of [5] (which in con- 
junction with fractional differencing leads to a relatively neat covariance 
matrix formula [18]). Semiparametric models (where Uf has nonparamet- 
ric auto covariance structure; see, e.g., [19, 23]) afford still greater flexibility 
than (1.3), but also require larger samples in order for comparable precision 
to be achieved. In more moderate-sized samples, investment in a parametric 
model can prove worthwhile, even the simple FARIMA(1, 5q, 0) employed 
in the Monte Carlo simulations reported in the supplementary material [14], 
while model choice procedures can be employed to choose pi and p2 in the 
FARIMA(pi,(5o,P2); as illustrated in the empirical examples included in the 
supplementary material [14]. 

We wish to estimate tq = {60, (^q)' from observations Xt, t = l,...,n. For 
any admissible r = {6,(p')', define 

(1.4) et{T) = A^e-\L;cp)xt, t > 1, 

noting that (1.1) implies xt = 0, t <0. For a given user-chosen optimizing 
set T, define as an estimate of tq 

(1.5) r = argmini2„(r), 

tgT 

where 

1 " 

(1.6) i?„(r) = -5^e?(r). 
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and T = X X ^, where I = {5: Vi < 6 < V2} for given yi, V2 such that 
Vi<V25^isa compact subset of M^ and tq £ T. 

The estimate t is sometimes termed "conditional sum of squares" (though 
"truncated sum of squares" might be more suitable). It has the anticipated 
advantage of having the same limit distribution as the maximum likelihood 
estimate of tq under Gaussianity, in which case it is asymptotically efficient 
(though here we do not assume Gaussianity). It was employed by [6] in 
estimation of nonfractional ARMA models (when 5q is a given integer), by 
[15, 21] in stationary FARIMA models, where < 5o < 1/2, and by [3, 24] 
in nonstationary FARIMA models, allowing 5q > 1/2. 

The following section sets down detailed regularity conditions, a for- 
mal statement of asymptotic properties and the main proof details. Sec- 
tion 3 provides asymptotically normal estimates in a multivariate extension 
of (1.1), (1-3). Joint modeling of related processes is important both for rea- 
sons of parsimony and interpretation, and multivariate fractional processes 
are currently relatively untreated, even in the stationary case. Further pos- 
sible extensions are discussed in Section 4. Useful lemmas are stated in 
Section 5. Due to space restrictions, the proofs of these lemmas, along with 
an analysis of finite-sample performance of the procedure and an empirical 
application, are included in the supplementary material [14]. 

2. Consistency and asymptotic normality. 

2.1. Consistency ofr. Our first two assumptions will suffice for consis- 
tency. 

Al. (i) 

\e{s;^)\^\9{s;^o)\ 

for all (^ / yjQ, y' G ^, on a set S C {s : |s| = 1} of positive Lebesgue 
measure; 
(ii) for all tp, 9{e ,(p) is differentiable in A with derivative in Lip((;'), 
? > 1/2; 

(iii) for all A, 6{e'^^;ip) is continuous in ip; 
(iv) for all (/^ G ^, \e{s;(p)\ /0,|s| < 1. 

Condition (i) provides identification while (ii) and (iv) ensure that ut is 
an invertible short-memory process (with spectrum that is bounded and 
bounded away from zero at all frequencies). Further, by (ii) the derivative 
of 9{e^^;ip) has Fourier coefficients jOj{if) = 0{j~'') as j — )■ 00, for all ^p, 
from page 46 of [27], so that, by compactness of ^ and continuity of 6j{(p) 
in (fi for all j, 

(2.1) sup|%(v')|=0(j~(^+^)) asj^oo. 
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Also, writing 6~^{s;(fi) = (j){s;(p) = YlTLo^jiV')^'^ ^ '^^ have i?^o(<^) = 1 for 
all cp, and (ii), (iii) and (iv) imply that 

(2.2) sup|0jM| = O(r^i+^^) asj^oo. 

Finally, (ii) also implies that 

(2.3) inf |(^(s;<,o)|>0. 

|s|=l 

Assumption Al is easily satisfied by standard parameterizations of station- 
ary and invertible ARMA processes (1.2) in which autoregressive and moving 
average orders are not both over-specified. More generally, Al is similar to 
conditions employed in asymptotic theory for the estimate t and other forms 
of Whittle estimate that restrict to stationarity (see, e.g., [8-10, 12, 21]) and 
not only is it readily verifiable because is a known parametric function, 
but in practice 6 satisfying Al are invariably employed by practitioners. 

A2. The Et in (1.3) are stationary and ergodic with finite fourth moment, 
and 

(2.4) E{et\Tt^i) = 0, £;(e?|J-t-i) = ^o 

almost surely, where J't is the c- field of events generated by e^, s <t, 
and conditional (on J-f-i) third and fourth moments of et equal the 
corresponding unconditional moments. 

Assumption A2 avoids requiring independence or identity of distribution 
of Et, but rules out conditional heteroskedasticity. It has become fairly stan- 
dard in the time series asymptotics literature since [11]. 

Theorem 2.1. Let (1.1), (1.3) and Al, A2 hold. Then as n -> oo 

(2.5) T-^pTQ. 

Proof. We give the proof for the most general case where Vi < ^o — ^, 
but our proof trivially covers the Vi ^ (^o ~ 2 situation, for which some of 
the steps described below are superfluous. The proof begins standardly. For 
e > 0, define N^ = {t:\\t - to\\ < e}, Ne = {T:T ^ Ne,T G T}. For smaU 
enough e, 



(2.6) Pr(T G N,) < Pr inl 5„(t) < 

where S„(r) = Rnir) — Rn{To). The remainder of the proof reflects the 
fact that Rn^r), and thus (S'„(t), converges in probability to a well-behaved 
function when 6 > 60 — ^, and diverges when 6 < 6q — ^, while the need 
to establish uniform convergence, especially in a neighborhood oi 6 = Sq — 
2, requires additional special treatment. Consequently, for arbitrarily small 



6 J. HUALDE AND P. M. ROBINSON 

ri> 0, such that r] < 5o — ^ — X/i, we define the nonintersecting sets Xi = 
{6:Vl < S <So - I - V}, l2 = {S:5o - I - V < ^ < ^0 - ^}, l3 = {S:So - 
^<S<Sq — ^ + r/}, l4 = {6:6o — ^ + 1] < 5 < V2}- Correspondingly, define 
7i = Xj X ^, z = 1, . . . , 4, so T = Ui=i 'Ti- Thus, from (2.6) it remains to prove 

(2.7) Prf inf 5„(r)<o)^0 asn^oo, i = l,...,4. 

Each of the four proofs differs, and we describe them in reverse order. 

Proof of (2.7) for i = 4. By a famihar argument, the result follows if for 
T G 71 there is a deterministic function U{t) (not depending on n), such 
that 

Sn{T) = U{T)-Tn{T), 

where 

(2.8) _inf f/(T)>e, 

e throughout denoting a generic arbitrarily small positive constant, and 

(2.9) sup|r„(r)|=Op(l). 

Since xt = 0, t < 0, for r G 71 we set [cf. (1.4)], (t{T) = A^^^o^{L;(p)ut, 
U{t) = ECK-r) - <yl and r„(T) = i?„(To) - ag - {R^{t) - EQi-^)}- We may 
write 



^<^>-«(i/:fy--0' 



where 

5(A) = |1 - e*^|2(^-^»)|</.(e*^ ^)\\ g^{\) = g{X)\^^^^. 
For all T (27r)~^ J^^log((7(A)/go(A)) (iA = 0, so by Jensen's inequality 

(2.10) ^r<% 

Under Al(i), we have strict inequality in (2.10) for all r ^ tq, so that by 
continuity in r of the left-hand side of (2.10), (2.8) holds. Next, write 



t—l oo 

J' 



j=0 j=0 



where Cj{t) = '^l.^Q(pk{^)0'j~k{^o — ^)- Because, given A2, the ef — aQ are 
stationary martingale differences, 

1 " 
(2.11) Rn{To)-a'^ = -^ief-al)^pO asn^oo. 
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Then defining 7^ = E{utUt-k), and henceforth writing Cj = Cj{t), (2.9) would 
hold on showing that 

, ^2 ^i_i X 2n 

(2.12) sup ' ' 

% 



-E 



(2.13) 



(2.14) 



\j=o J \i=o / 

^ n t— 1 00 

t=l j=0 fc=t 
^ n 00 00 



Op(l). 



sup 

r4 



sup 

r4 



n ■ 



Op(l), 



Op(l). 



n 



t=\ j=t k=t 
We first deal with (2.12). The term whose modulus is taken is 

n-l n~j 

^Es^E(-?-7o) 

i=o 1=1 

n—2 n—1 n—j 

E E ^j^^ E {^i-uz-cfe-j) - 7j-fe} 

jr=0 k=j + l l = k-j + l 

(a) + (6). 



(2.15) 



2 

n 



First, 



^ n— 1 

^sup|(a)| <- Vsupc?^ 

T4 ^r^n Ti 



n-j 



E(-'-^o) 



(=1 



It can be readily shown that, uniformly in j, Var(^j^-^'^ uf) = 0{n), so 



Ta 



j=l 



by Lemma 1. Next, by summation by parts, (6) is equal to 

n—2 n—1 n—j 



2c„_i 



n 



E^i E E {um-^k-j) - ij-k} 

j=0 k=j+ll=k-j+l 
n—2 n-2 k n—j 



~~E^J E (^^-+1"^^) E E {"«^Z-(r-j) - 7i-r} 
j=0 k=j+l r=j+ll=r—j+l 

= {bi) + {b2). 
It can be easily shown that, uniformly in j, 

(n-l n-j \ 

E E ^'^/-(fc-j) =0(n2), 
k=j+ii=k-j+i J 
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SO we have 



n C / n-1 n-j \ ^ 1/2 

i=i I \fc=j+i/=fc-j+i / ) 

by Lemma 1, where K throughout denotes a generic finite but arbitrarily 
large positive constant. Similarly, 

Esnp\{b2)\ 

T4 

n n 

i=i fc=j+i 

r / fc n-i \ >| 1/2 

x<^Var J]; J]; u,n;_(,_j) \ 

I \r=i+l/=r-j+l / J 

by Lemma 1, where ? was introduced in Al(ii). It can be readily shown that 

(k n-j \ 

^ ^ uiUi_i^,_A<K{k-j){n-j). 
r=j+ll=T~j+l J 

Take r] such that ?] + | < 1 + ^. Then 



'^^ i=i fc=j+i 

n n 

3=1 k=l 

This is bounded by 

n n 

(2.16) ifn-i/2^i-3^-V2^A:''-i, 

jr = l fe=l 

because (A; + j)"''"^'^ < j~'^^kP~^l'^ . For small enough ry, (2.16) is bounded 
by Kn~'^^, to complete the proof of (2.12). Next, the term whose modulus 
is taken in (2.13) is 

^ n ^Tj- t—1 oo 

(2.17) -^/ /(A)^5]c,c,e^(^-^')^dA, 
'^1=1-'-^ i=Ok=t 
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where /(A) denotes the spectral density of ut- By boundedness of / (imphed 
by assumption Al) and the Cauchy inequahty, (2.17) is bounded by 






t-i 



^c,e 



UJL 



ijX 



dX 



j=0 
n ( t—1 oo 



5:c,e-^'^ 



k=t 



2 X 1/2 

dx\ 



1/2 



i=i Lj=o fc=t J 
so the left-hand side of (2.13) is bounded by 



n ( t 



1/2 



Kn-^ E 1 E •?■"'''"' E ^'^""'^ \ ^ ^""' E *"'' ^ ^^"^ = '^(i) 

t=l [j=l fc=t J t=l 

by Lemma 1, to establish (2.13). Finally, by a similar reasoning, the term 
whose modulus is taken in (2.14) is bounded by 

2 >j 1/2 n 



^•""'6{/ 



J^c.e 



ijA 



i=t 



i=l 



to conclude the proof of (2.14), and thence of (2.9). Thus, (2.7) is proved for 
z = 4. With respect to (2.7) for i = 1, 2, 3, note from % n A^^ = % for such i, 
and (2.11), that these results follow if 

(2.18) PrAnfi2„(T)<A') ^0 as n ^ oo,i = 1,2,3. 

Proof of (2.7) for z = 3. Denote, for any sequence C,f, W(^{X) = 

"■"^ X^jLi Ct X 6** ; -^c('^) ~ l'"^c('^)l^' ^^^ discrete Fourier transform and 
periodogram, respectively, and Xj = 2-kj /n. For Vn{T) satisfying Lemma 3, 
setting r* = (<5,(^'o)', 

n n 

^nir) = -E^^m(^^-) = -El^(^^'^'^)l'^e(x*)(A,) + -K(r), 



i=i 



i=i 



where C(s; v') = 6l(s; vJo)/6'(s;¥j) = X;°^o^j(<^)sJ. Then 



1 



(2.19) infi?„(T)> inf 1^(6*^ (^)r inf i?„(T*) - sup-|K(r) 

¥>G>I' 



Assumption Al implies [see (2.3)] 



inf Mie^';^)\'>e. 

Ag[-7r,7r] 
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Thus, 

(2.20) 
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1 " /*"^ V 

inf i?„(T) > einf — > > CLjSt^j 



sup-|K(t)| -sup-|T4^„((5)|, 



where Oj = Oj {5^ — 5) , and by Lemma 2 



t-i 



t=l t=l j=0 

By Lemma 2 and (0.6) in the proof of Lemma 3 in the supplementary ma- 
terial [14] (taking k = 1/2 there in both cases) 

(2.21) snp-\Wn{S)\ = oJn-' +^-^) =Op{l), 
and also by Lemma 3 (with k = 1/2 there) 

(2.22) sup-|K(r)|=oJi^^^') =Op(l). 
Next, note that for 5 G X3 

(2.23) -^ = -2{i;{j + 60-6)- ij{6o - 6))a] < 0, 

where we introduce the digamma function tl^i^x) = {d/dx)logT{x). From (2.23) 
and the fact that ip{x) is strictly increasing in x > 0, 



(2.24) 



inf n-^ ^ K^ ajEt-j > ^"' J^ J^ a' ( 2 " ^ ) ^t-j 
^ t=i \j=o / t=i j=o ^ ^ 

^ n t—1 



sup 

X3 



2 

— sup 
n X3 



i=i jVfc 

By a very similar analysis to that of (6) in (2.15), the second term on the 
right-hand side of (2.24) is bounded by 

n— 2 n— 1 n—j 

Yl Yl "j"^ Yl ^i^i~i.k~j) 

j=0 k=j+l l=k~j+l 

n—2 n—1 n—j 

Y^^ Y Y ^i^i-ik-j) 

j=0 k=j+ll=k-j+l 

n~2 n-2 k n—j 

/ .Qj Y (^k+i-ak) Y Y ^>-^l~{k-j) ) 
j=0 k=j+l r=j+l l=r—j+l 



2 
< — sup 

n X3 



2 

+ -sup 
n J3 
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which has expectation bounded by 

j=l j=l fc=l 



„.r-Y^r'"+^Y.r"''Y:('^+fr"v''- 



(2.25) 



\ j=i k=i / 



for any < a < 1/2. Therefore, there exists a large enough K such that 

n t-1 



(2.26) Prfsup 



n ^^^^ajakSt-jEt-k 
t=i j^k 



>K ^0 



as n^oo. Then, noting (2.20), (2.21), (2.22), (2.26), we deduce (2.18) for 
i = 3if 



\ t=l j=o ^ ^ 



ef_j < K — > as n — ^ oo. 



Now 

^i:E<.?(i-.)^?-,-s,.^+ii:E«5G-')(^?-,-.s) 

t=l j=0 ^ ^ \ I ' I) ^^l J^Q \ / 

-?EE4i-.. 

t=i j=t ^ ^ 

The third term on the right is clearly 0(n"^''), whereas, as in the treatment 
of (a) in (2.15), the second is Op{n~^l'^), so that (2.27) holds as r(27?)/r2(i + 
T]) can be made arbitrarily large for small enough r]. This proves (2.18), and 
thus (2.7), for i = 3. 

Proof of (2.7) for i = 2. Take rj < 1/4 and note that X2 C [Sq — K,6o — ^ + r]) 
for K = ri + ■^. It follows from Lemma 2 and (0.6) in the proof of Lemma 3 
(see supplementary material [14]) that 

supir„(5)|=oJlVt2,-l+ly^^,-l/2^,^ 

(2.28) 

= Op(n2'?-V2) = Op(l). 

It follows from Lemma 3 that 

(2.29) sup-|K(T)|=Op(n2^"^) = Op(l). 

T2 ^ 
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Denote /n,(<J) = ri"^ X;[Li(E5=o«i^t-i)^- ^y (2.28), (2.29), it follows 
that (2.18) for i = 2 holds if for arbitrarily large K 

(2.30) Pr(inf/„(,5)>i^)^l 
as n — )• oo. Clearly, 

2{5o~S) 1 v^/^ \^ 

(2.31) inf/„((5)>inf inf ,,, ,, > } a^et-i ■ 

t=\ \j=o / 

Defining bj^n[d) = aj{d)/n'^~^, bj^n = bj^ni^Q — ^), the right-hand side of (2.31) 
is bounded below by 



(2-32) inf \y^6?,„7^.f sup J, 


n.-2 n-1 


n-j 

bj,nbk,n y ^ £l^l~ik~j) 
l=k-j+l 


For 1 < i < n. 




mf 6j, n > mf -— — mf - 


-T{l/2 + ri)[n) 


fn 00\ ^iiT-.?! ^ "iiT-i "iml 




\ K n\-"^ 


J2 X2 r((5o-()) X2 V 


^ y/TT\n) 



Then by (2.33), using summation by parts as in the analysis of {h) in (2.15), 
the expectation of the second term in (2.32) is bounded by 

-1/2 ^ " 



K 



n 



E 



-3/2 



j=i ^ ' i=\ k=\ 

which, noting (2.25), is 0(1). Next, the first term in (2.32) is bounded below 

by 



n-l 



n-1 



n-3 



E('i--o) 



1=1 



(2-34) 5E(^-^>in(l/2 + r/)-^^feiJl/2) 

j=0 j=0 

Using (2.33) it can be easily shown that the second term in (2.34) is 
Op{n~^''^ X X]^=i t) ~ Op{n~^''^ log n), whereas the first term is bounded 
below by 

2r,> 



n r / ■\ 2n-l 



n 



E 



i=i 



n 



(2.35) 



2 Jl/n 2 

Op(n-2^). 



^2r? 3;2r?+l 1 1 



2r? 2r/ + l 



l/n 



4r/(2r/ + l) 
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Then (2.30) holds because the right-hand side of (2.35) can be made arbi- 
trarily large on setting rj arbitrarily close to zero. This proves (2.18), and 
thus (2.7), for i = 2. 

Proof of (2.7) for i = 1. Noting that i?„(r) > n"2(^^^^e,(r))2, 

(2.36) P^(}nfRn{r)> K)>Fr(^n'^inf(^-^^ > K^, 

because 6o — S > ^ + ri. Clearly X^J^^^ ^ti''') = X]?=o 'ij{'^)'^n-j-, where 
i j j-k j 

^i^'^) = ^'^k{T) = ^(l)k{^)^ai{5o - 5) = ^(t)k{'^)aj-k{^o -5 + 1). 

fc=0 A;=0 /=0 fc=0 

For arbitrarily small e > 0, the right-hand side of (2.36) is bounded from 
below by 

for n large enough, so it suffices to show (2.37) — )■ 1 as n — ^ oo. First 

1 " 

^ t=i 

where hn{5) ="-~^^^X^?=o ^j,n(,5o — 6 + l)en-j, ^j,n(') was defined below (2.31) 
and 

^ n — 1 oo 

rnir) = YJ^ ^ fej,n((5o -6+1) ^ <pk{^)Un-j 

j=0 k=j+l 

^ n~l 
(2-38) - -^ Y^ Sj^n{r)Un-j 



ni/2 



:i;¥': 



n-l 



for 



ni/2 . 

j=0 



Sj>('r) = ^(&fc+l,n('5o - (5 + 1) - &fc,n('5o -6 + 1))^ 4>j-l{^), 
fc=0 i=0 

where (2.38) is routinely derived, noting that by summation by parts 

dj{T) = aj{6o -5-1-1) 

j j-i k 

X Y^ (pk{(p) - ^(afc+i(<5o -6 + 1)- ak{6o -6 + l))Y'Pj~iiv>)- 

fc=0 fc=0 1=0 



14 
Now 



inf 

Ti 



J. HUALDE AND P. M. ROBINSON 



[-^is^fl'^i-)) >0\i;^o)%i<P\i;^)^nfhli6) 



-i^sup|(^(l;(^)|sup|/i„(5)|sup|r„(r)|. 
* Ji Ti 

Noting (2.3) and that under Al, sup^ |(/)(1;(^)| < oo, the required result 
foUows on showing that 



(2.39) 
(2.40) 

(2.41) 



sup|r.„(r)| = Op(l), 
Ti 

sup|/i„,((5)| = Op(l), 



,2, 



Pr(inf/i^(<^)>e)^l 



as n — )• oo. 

The proof of (2.40) is omitted as it is similar to and much easier than the 
proof of (2.39), which we now give. Let rn{T) = ^j^j^ rinir). By the Cauchy 
inequality 

/n-l / oo \ 2 n \ 1/2 

sup|ri„(T)|<— ^^sup6^^„((5o-'^ + l) sup ^ |(/'fc(¥')l X^^i ' 



Ti 



n 



Kj=0 ^1 



\ * k=j+i / i=i 

so that by (2.2), noting that £^(ELi^?)^^^ < Kn^/"^, 



Ssup|ri„(T)|<if(^sup(^) ( Y. ^"' 



2\ 1/2 



because <;" > 1/2 by Al(ii). Next, by summation by parts 



n / . N l+2»y 



\k=j+i / / 

1/2 

< Kn^/^~^ = oil) 



n-l 



n-2 



r2' 



k=l 



SO 



J=l 
SUP7-Js„_l,,i(T 



'^n—ki 



i=i 



n-l 



sup|r2„(T)| < 
Ti 



n 



1/2 



(2.42) 



^n„_ 



i=i 



n-2 



+ ^72 X] ^^P I^J+i."^'^) ~ '^J>('^) 



n^--T. 



y^^n-fc 



fc=l 



GAUSSIAN ESTIMATION OF FRACTIONAL TIME SERIES MODELS 15 
Given that afc+i(5o - S + 1) - ak{So - 6 + 1) = afc+i('^o - (5), 

fc=0 1=0 

so as ElY^"^^-^ Uj\ < Kv}''^ , noting (2.2) and Stirling's approximation, the 
expectation of the first term on the right-hand side of (2.42) is bounded by 

fc=i 2;i v / i^^ 

K " 



fc=i 



^Ivi^-V Y --V 



< 



k=l 

Next, noting that aj+i(5o — 5) — aj{5o — 6) = aj+i{6Q — 6 — 1), it can be shown 
that 



^ j i+i 



fc=l i=j-k+2 

(2.43) 

i+i 






1=1 



Thus, noting that, uniformly in j, re, -E'l^'^^^ u„_fc| < Kj^'"^, by previous 
arguments the contribution of the last term on the right-hand side of (2.43) 
to the expectation of the second term on the right-hand side of (2.42) is 
bounded by 

r^ n . . So-5 r^ n . . 1/2+-;, 



<Kn 



By identical arguments, the contribution of the first term on the right-hand 
side of (2.43) to the expectation of the last term on the right-hand side 
of (2.42) is bounded by 



1=1 k=l l=j-k -^1 \ / 
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(2.44) 

j=\ k=\ l=j—k 

Given that ELj-fc ^"^/^+'' < K{j - k)^^/^+''k, the right-hand side of (2.44) 
is bounded by 

-^f;//^g.-(,-.)-3/^-'' 

j=i k=i 

j=l fc=[j/2]+l 

where [•] denotes integer part. Clearly, the right-hand side of (2.45) is bounded 
by 



n^+'i ' ' '' 

3=1 \ fc=i 



- ^^-1/2 ,-3/2+,^-l-. +j-<^Y.k-^/^+V < i^(^-. + „1/ 



SO sup-j-^ |?'2n(''')| = Op(l) because <j > 1/2. Next, writing ut = 9{l;(pQ)et + 

et-i-et, for Si = Eilo^i(</^o)et-i> ^jiv>o) = Efctj+i 6'fc(¥'o)> where, by Al, 
A2, Et is well defined in the mean square sense, we have 

if^_ \ /n~l \ 

rsnir) = So-S+1/2 I "^aj^do - 6)£n-k - an~i{So -6 + l)£o |. 

\j=0 / 

In view of previous arguments, it is straightforward to show that 
sup7-Jr3„(r)| = Op(l), to conclude the proof of (2.39). 

Finally, we prove (2.41). Considering hn{6) as a process indexed by 6, we 
show first that 

where B{s) is a scalar Brownian motion with variance cJq and =^ means weak 
convergence in the space of continuous functions on Xi . We give this space 
the uniform topology. Convergence of the finite-dimensional distributions 
follows by Theorem 1 of [13], noting that A2 implies conditions A(i), A(ii) 
and A(iii) in [13] (in particular A2 implies that the fourth-order cumulant 
spectral density function of et is bounded). Next, by Theorem 12.3 of [4], 
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if for all fixed 6 €Zi hn{S) is a tight sequence, and if for all 61,62 G Ii and 
for K not depending on 6i,62,n 

(2.47) E{K{6i) - K{62)f < K{6i - 62?, 

then the process hn{6) is tight, and (2.46) would follow. First, for fixed 6, 
it is straightforward to show that sup,^£'(/i^(5)) < 00, so hn{6) is uniformly 
integrable and therefore tight. Next, 

E{K{6i) - K{62)f 



cr, 



2 "-1 



J^(&i,n(<^0 -61 + 1)- bj^n{6o -62 + l)y 



n 

j=0 

^ ^li6i - 62? y^ {a'j{6o-~6 + l) - aj{6o-~6 + l)\ognf 

by the mean value theorem, where 5 = 5„ is an intermediate point between 6\ 
and 62- As in Lemma D.l of [22], 

a'j{6Q -'6 + I) - Uj ((5o - 5 + 1) log n 

= WJ + 60-6+1)- i){6o - ^ + 1) - \ogn)aj{6o -5 + 1). 
Now (2.47) holds on showing that, for (^ gXi, 

(2,48) f^hzI±Jl'''^tlJS,-l+l)<K. 

j=0 



(2-49) ^ E(^(J' + So -6 + 1)- logn)hl^{6o -6 + l)<K. 

By Stirling's approximation, the left-hand side of (2.48) is bounded by 



K 



^^(5o-Vi + l)^ /,x2(5o-^) 



iZ^yi'-. 



.^'(5o - Vi + 1) >^ fj 



<K^^^^^^l±^±suJir\K. 



n 






Regarding (2.49), it can be shown that uniformly in Xi, ip{j + 60 — 6 + 1) = 
logj + 0{j~^) (see, e.g., [1], page 259). Thus, apart from a remainder term 
of smaller order, the left-hand side of (2.49) is bounded by 

(2.50) A--^ log^ bl^^So-~6 + l)<K-Y,(iogl) U 
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uniformly in Xi, the right-hand side of (2.50) being bounded by 
K L {logx)"^ dx = 2K, to conclude the proof of tightness. Then by the con- 
tinuous mapping theorem 

This is a.s. positive because the quantity whose infimum is taken is a Xi 
random variable times crg/[{2((5o — <5) + l}r(5o — S + 1)^], which is bounded 
away from zero on Xi . Thus as n — ?• oo 



Pr(inf/i^((^)>e 

■ 2^1 



Xi \Jo T{6o -6 + 1) 



>e 



and (2.41) follows as e is arbitrarily small. Then we conclude (2.18), and 
thus (2.7), for i = l. D 

2.2. Asymptotic normality of r. This requires an additional regularity 
condition. 
A3, (i) 

To G int T; 

(ii) for all A, 9{e^^]ip) is twice continuously differentiable in ip on 

a closed neighborhood Mei^o) ^^ radius < e < 1/2 about (Pq; 
(iii) the matrix 



/ 



vrVe 



■Eb;-(^o)/j 



. \ 



i=i 



v 



/ 



is nonsingular, where bj((^o) = Y^k=o^k{v>o)94>j~kiVo)/d^- 

By compactness oiMei^o) and continuity oid(l)j{(p)/d(pi, d'^(j)j{(p)/dipi dfi, 
for all j, with i,l = 1,. . . ,p, where fi is the ith element of (p, Al(ii), Al(iv) 
and A3(ii) imply that, as j — )• oo 



sup 

V5eA4(<^o) 



iiv>) 



d(pi 



0{j 



-(i+?) 



), 



sup 



d'Mf) 



difi difi 



0{j 



-(i+?) 



which again is satisfied in the ARMA case. As with Al, A3 is similar to con- 
ditions employed under stationarity, and can readily be checked in general. 



Theorem 2.2. Let (1.1), (1.3) and A1-A3 hold. Then as n^oo 
(2.51) n^/\9-ro)^dN{0,A-^). 
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Proof. The proof standardly involves use of the mean value theorem, 
approximation of a score function by a martingale so as to apply a martin- 
gale convergence theorem, and convergence in probability of a Hessian in 
a neighborhood of tq. From the mean value theorem, (2.51) follows if we 
prove that 



(2.52) ^^^^.iV(0,ao^A), 

(2.53) 2^79^^^"'''^' 

where ||t — to|| < \\t — tq\\. 

Proof of (2.52). It suffices to prove 

V t=2 j=l 

and 

^ n oo 

(2.55) -^Y.^tY.'^j('^o)et-j ^dN{0,alA), 

^" t=2 0=1 

where mj((po) = (~J~^)b'(¥'o))'- -^y Lemma 2, the left-hand side of (2.54) 
is the (p + 1) X 1 vector (ri + r2 + rs, (si + S2)')', where 



n 00 



^ t=2 j=t -^ 

^ n t—1 ^ 00 

^ t=2 j=l ■' k=t-j 

. n t-1 t-i-1 



i=2 i = i fc=o 

n 00 



n ^-^ ^-^ d(p 

t=2 j=t ^ 



t=2 j=t 
^ t=2 .7=1 ^ 



Clearly, £'(ri) = 0, and 

t=2 j=t s=2 k=s •' t=2 j=f' ^ ^ 
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noting that, by A2, the et and e^ — Uq are martingale difference sequences. 
Thus, ri = Op(n~^'^log^'^n). Next, E{r2) = 0, and Var(r2) equals 



^ n t—1 oo n s— 1 oo 

p-56) ^EEEEEE 



t=2 j=l fc=t-j s=2 1=1 m=s-l 



(f>k{'Po)(t>m{fo) 
jl 



E{et£sUt-j-kUs 



From (1.3) and A2, the expectation is aQ^j^k-i-m for s = t, and zero oth- 
erwise. By Al, ut has bounded spectral density. Thus, (2.56) is bounded 
by 



1 _ ^_ r'^ 



t—l oo 

EE 

j=i k=t-j 



Mv>o) ju+k)i, 



dfi 



t=2 j=l k=t~j 1=1 ■' 



n 

t=2 j=l k=t-j 1=1 



n 



t=2 1=1 



Now 



t-1 

E 

1=1 



it -I) 



-1-? 



[t/2] 

E 



I 



it -I) 



t-1 

E 



jl 



(t-j)-' 



-1-? 



E 

l=[t/2]+l 

K 



(t-l) 



-1-? 



<K{t~'^'logt + t-')< 



Then Var(r2) = 0{n-^ YJi=2 *"^ T!f=i3~^) = 0{n~^ log^ n), so 

f2 = Op{n~ ' logn). 
Next, by Lemma 2 



t=2 



Also, -E'(si) = and 



Var(si) = 



^ n oo oo 

^ Z—^ Z—^ Z—^ 

t=2 j=t k=t 






d(p dip' 
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4ii: 



j=t 



1 



I ji 2-^2-^ 



t=2 j=t 



i^ji'Po) 



dip 



1 



\ n^ 



2? 



O(n-i), 



t=2 



since ? > 21 II ' II denoting Euclidean norm. Finally, by Lemmas 2 and 4 

S2 = 0jn-V2^t-l/2-A =0^(^-1/2)^ 

to conclude the proof of (2.54). 

Next, (2.55) holds by the Cramer-Wold device and, for example, Theo- 
rem 1 of [7] on showing that 



(2.57) 
and 

(2.58) 






Ti 



i~\ 



a.s. 



Tt- 



-, n / 00 00 

t=2 \ j=l k=l 

^ n / 00 00 \ 

t=2 \ j=l k=l / 



because £'(ej Yy'jLi ^T=i ^ifo)™-'ki^o)^t-j£t-k\J^t~i) has expectation a^A, 
noting that the Lindeberg condition is satisfied as e^ ^°^]^mj((^Q)ej__j is 
stationary with finite variance. Now (2.57) follows as £t-j, j ^ Ij is J~t~i- 
measurable, whereas the left-hand side of (2.58) is 

2 n 00 00 

-^^^^^ji'Po)™-'kiVo){^t-jet-k - E{et-jet-k)) ^p 0, 

t=2 j=l k=l 

because Y.'jLiT,'kLi^j(.'Po)T^'k(.'Po)i^t-jet-k - E{et-jet_k)) is stationary 
ergodic with mean zero. This completes the proof of (2.55), and thus (2.52). 
Proof of (2.53). Denote by N^ an open neighborhood of radius e < 1/2 
about To, and 

^ n t-1 t-1 

(2.59) 
(2.60) 



1 



A"M = ;EEE 



n 

t=2 j=0 k=l 

00 00 / q2 



d^Ck dcj dc,_ 
•' dr dr' dr dr' ' "■'' 



A(r) = EE 



j=0 k=l 



d Ck dcj dck 



+ 



OtOt' dr dr' 



7 Hk-j. 
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Trivially, 
2 drdT' 



2 drdi 



-A„,(r) + A„(T)-A(r) + A(T)-A(To) + A(ro). 



Because Cj(to) = (/>_,• (tq), it follows that '^?LQCj{To)ut-j =£*, so the first 
term in A(to) is identically zero. Also, as in the proof of (2.55), the second 
term of A(to) is identically ctq A. Thus, given that by Slutzky's theorem and 
continuity of A(r) at tq, A(t) — A(ro) =Op(l), (2.53) holds on showing 

1 d'Rnir) 



(2.61) 
(2.62) 



sup 



A„,(t) 



Op(l), 



sup||A,,(t)- A(t)|| =Op(l) 



(2.63) sup 



for some e > 0, as n — ?• oo. As e < 1/2, the proof for (2.61) is almost identical 
to that for (2.12), noting the orders in Lemma 4. To prove (2.62), we show 
that 

n t-1 t-1 „2^ °o °o a2^ 

- 2^ 2^ 2^ ^i sTs^^'^"^' " 2^ 2^ '^^^ sTsT^^'^-^' 

i=2 j=0 fc=l j=0 fc=l 

is Op(l), the proof for the corresponding result concerning the difference be- 
tween the second terms in (2.59), (2.60) being almost identical. By Lemma 4, 
(2.63) is bounded by 

jy n t oo „ 71 oo 



n 



(2.64) 



i=l i=l fc=t+l 

71 oo oo 



n 



t=i j=t 



+ ^EE E /-^^^-^(fc-j)-^-^iog^fc, 

noting that (2.1) implies that 7^ = 0{j^^~''). The first term in (2.64) is 
bounded by 



n 00 



(2.65) -J^f 5^ A:^+-i(fc_t)-W<fi^i^^(A, + t)^+"-ifc-i-^ 

t=i fc=t+i t=i k=i 

for any a > 0. Choosing a such that 2e + a < 1, (2.65) is bounded by 



K 

n 



Y, *''+""' E '^"'"' = o(^'''"""') = 0(1)- 

i=l fc=l 

Similarly, the second term in (2.64) can be easily shown to be o(l), whereas 
the third term is bounded by 



(2.66) 



K 



n 



EE. 

t=i j=t 



2e+a-2 



E (k-^y 



■i-? 



k=j+i 
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for any a > 0, so choosing again a such that 2e + a < 1, (2.66) is 0(n^'^"^'^~^) = 
o(l), to conclude the proof of (2.53), and thus of the theorem. D 

3. Multivariate extension. When observations on several related time 
series are available joint modeling can achieve efficiency gains. We consider 
a vector x^ = {xu, ..., Xrt)' given by 

(3.1) xt = A^Hutl(t>0)}, t = 0,±l,..., 
where uj = (uu, . . . ,11^)', 

(3.2) Ut = &{L;ipo)et, t = 0,±l,..., 

in which et = {eu,. ■ ■ jErt)' , ^0 is (as in the univariate case) a px 1 vec- 
tor of short-memory parameters, &{s;(p) = J27Lo®jiv)s''^ ®o{v) = Ir for 
all (^, and Aq = diag(A o^ , . ■ . , A O''), where the memory parameters Sqi are 
unknown real numbers. In general, they can all be distinct but for the sake 
of parsimony we allow for the possibility that they are known to lie in a set 
of dimension q < r. For example, perhaps as a consequence of pre-testing, 
we might believe some or all the 5oi are equal, and imposing this restric- 
tion in the estimation could further improve efficiency. We introduce known 
functions 6i = 5i{S), i = l,. . . ,r, of q x 1 vector S, such that for some <5o we 
have Soi = 6i{So), z = 1, . . . , r. We denote r = {S', cp')' and define [cf. (1.4)] 

St{r) = &-\L-<f)A{S)xt, t>l, 

where A{S) = diag(A^ ,■■■■, A ''). Gaussian likelihood considerations suggest 
the multivariate analogue to (1.6) 

(3.3) K(T) = det{S„(T)}, 

where 5]„(t) = i^~^Ylt=i^ti''')^ti''')^ assuming that no prior restrictions 
link To with the covariance matrix of et- Unfortunately our consistency 
proof for the univariate case does not straightforwardly extend to an esti- 
mate minimizing (3.3) if g > 1. Also (3.3) is liable to pose a more severe 
computational challenge than (1.6) since p is liable to be larger in the mul- 
tivariate case and q may exceed 1; it may be difficult to locate an approxi- 
mate minimum of (3.3) as a preliminary to iteration. We avoid both these 
problems by taking a single Newton step from an initial -^/n-consistent esti- 
mate T. Defining 



1 " / 


'dst{T)y^^,^ ^det{T) 
, dr' ) ^- ^^) dr' 


H..w4ij( 


^'t^)^-n\r)e.irl 


we consider the estimate 




(3.4) T = T 


-H-i(r)h„(r). 
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We collect together all the requirements for asymptotic normality of r in: 

A4. (i) For all <^, 0(e ; (^) is differentiable in A with derivative in Lip(?), 
?>l/2; 

(ii) for all cp, det{e(s; cp)} / 0, \s\ < 1; 

(iii) the et in (3.2) are stationary and ergodic with finite fourth mo- 
ment, E{et\Tt-i) = 0, E{£t£^\Tt-i) = XIq almost surely, where Sq 
is positive definite, J-t is the cj-field of events generated by £s, 
s <t, and conditional (on J-f-i) third and fourth moments and 
cross-moments of elements of St equal the corresponding uncon- 
ditional moments; 

(iv) for all A, 0(e , t^) is twice continuously differentiable in cp on 

a closed neighborhood A4(v'o) °f radius < e < 1/2 about (^q! 
(v) the matrix B having (i,j)th element 

oo 

5^tr{(d«(v.o))'5]o"M?(¥'o)So} 
fc=i 

is nonsingular, where 

* 1=1 m=0 

1=1 '^* 
the $j(v') being coefficients in the expansion 0~ (s; cp) = $(s, ip) = 
^°f^Q$j((^)s-', where ^mi'Po) is an r x r matrix whose ith col- 
umn is the ith column of <I>j((^g) and whose other elements are all 
zero; 
(vi) 6i{S) is twice continuously differentiable in d, for z = 1, . . . ,r; 
(vii) r is a -^/n-consistent estimate of tq. 

The components of A4 are mostly natural extensions of ones in Al, A2 
and A3, are equally checkable, and require no additional discussion. The 
important exception is (vii). When @(s;(p) is a diagonal matrix [as in the 
simplest case @{s; cp) = 1^, when xu is a FARIMA(0, 6oi,0) for i = 1, . . . , r] 
then T can be obtained by first carrying out r univariate fits following the 
approach of Section 2, and then if necessary reducing the dimensionality in 
a common-sense way: for example, if some of the (5oi are a priori equal then 
the common memory parameter might be estimated by the arithmetic mean 
of estimates from the relevant univariate fits. Notice that in the diagonal-© 
case with no cross-equation parameter restrictions the efficiency improve- 
ment afforded by r is due solely to cross-correlation in St, that is, nondiag- 
onality of Sq- 



GAUSSIAN ESTIMATION OF FRACTIONAL TIME SERIES MODELS 25 

When &{s; (f) is not diagonal, it is less clear how to use the \/n-consistent 
outcome of Theorem 2.2 to form r. We can infer that uj has spectral den- 
sity matrix (27r)~^0(e* ; y3o)^o0(e~ , V'o)'- From the ith diagonal element 
of this (the power spectrum of un), we can deduce a form for the Wold 
representation of uu, corresponding to (1.3). However, starting from inno- 
vations £t in (3.2) satisfying (iii) of A4, it does not follow in general that 
the innovations in the Wold representation of uu will satisfy a condition 
analogous to (2.4) of A2, indeed it does not help if we simply strengthen A4 
such that the £t are independent and identically distributed. However, (2.4) 
certainly holds if et is Gaussian, which motivates our estimation approach 
from an efficiency perspective. Notice that if uj is a vector ARMA process 
with nondiagonal 0, in general all r univariate AR operators are identical, 
and of possibly high degree; the formation of r is liable to be affected by 
a lack of parsimony, or some ambiguity. 

An alternative approach could involve first estimating the Joi by some 
semiparametric approach, using these estimates to form differenced x^ and 
then estimating (^q from these proxies for Uj. This initial estimate will be 
less-than-y/n-consistent, but its rate can be calculated given a rate for the 
bandwidth used in the semiparametric estimation. One can then calculate 
the (finite) number of iterations of form (3.4) needed to produce an estimate 
satisfying (2.51), following Theorem 5 and the discussion on page 539 of [17]. 

Theorem 3.1. Let (3.1), (3.2) and A4 hold. Then as n^oo 
(3.5) n^/\9-To)^dN{0,B~^). 

Proof. Because r is explicitly defined in (3.4), we start, standardly, by 
approximating hn{T) by the mean value theorem. Then in view of A4(vii), 

(3.5) follows on showing 

(3.6) V^Kiro) ^dN {0,B), 

(3.7) H„(ro)^pB, 

(3.8) H„(T)-H„(ro)^pO 

for ||t — Toll < llr — ro||. We only show (3.6), as (3.7), (3.8) follow from 
similar arguments to those given in the proof of (2.53). Noting that 5ei(To)/ 
dr' = 0, whereas for t > 2, d£t{TQ)/dT' equals 

t-j-i t-j-i 

fc=l k=l 
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by similar arguments to those in the proof of Theorem 2.2, it can be shown 
that the left-hand side of (3.6) equals 



V" t=2 \i=l j=l / 



(1). 



Then by the Cramer- Wold device, (3.6) holds if for any (r +p) -dimensional 
vector 1? (with ith component ??,) 



n oo 



V t=2 j=l 

where Mj{(fQ) = Elti^fcdf ^q)- As in the proof of (2.55), (3.9) holds by 
Theorem 1 of [7], for example, noting that 



(oo oo \ 

j=i k=i I 

(oo oo \ 

j;j;tr{sU^.M;.(<^o)5^o"'Mfc(<^o)^t-fc} 
i=i k=\ J 

CO 

= ^tr{M;.(<^o)5:o"'M,((^o)5:o} = ^'B^ 

to conclude the proof. D 

4. Further comments and extensions. (1) Our univariate and multivari- 
ate structures cover a wide range of parametric models for stationary and 
nonstationary time series, with memory parameters allowed to lie in a set 
that can be arbitrarily large. Unit root series are a special case, but unlike in 
the bulk of the large unit root literature, we do not have to assume knowl- 
edge that memory parameters are 1. Indeed, in Monte Carlo [14] our method 
out-performs one which correctly assumes the unit interval in which 5q lies, 
while in empirical examples our findings conflict with previous, unit root, 
ones. 

(2) As the nondiagonal structure of A and B suggests, there is efficiency 
loss in estimating lPq if memory parameters are unknown, but on the other 
hand if these are misspecified, c^q will in general be inconsistently estimated. 
Our limit distribution theory can be used to test hypotheses on the memory 
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and other parameters, after straightforwardly forming consistent estimates 
of A or B. 

(3) Our multivariate system (3.1), (3.2) does not cover fractionally coin- 
tegrated systems because Sq is required to be positive definite. On the other 
hand, our theory for univariate estimation should cover estimation of indi- 
vidual memory parameters, so long as Assumption A2, in particular, can be 
reconciled with the full system specification. Moreover, again on an individ- 
ual basis, it should be possible to derive analogous properties of estimates 
of memory parameters of cointegrating errors based on residuals that use 
simple estimates of cointegrating vectors, such as least squares. 

(4) In a more standard regression setting, for example, with deterministic 
regressors such as polynomial functions of time, it should be possible to 
extend our theory for univariate and multivariate models to residual-based 
estimates of memory parameters of errors. 

(5) Adaptive estimates, which have greater efficiency at distributions of 
unknown, non-Gaussian form, can be obtained by taking one Newton step 
from our estimates (as in [20]). 

(6) Our methods of proof should be extendable to cover seasonally and 
cyclically fractionally differenced processes. 

(7) Nonstationary fractional series can be defined in many ways. Our defi- 
nition [(1.1) and (3.1)] is a leading one in the literature, and has been termed 
"Type II." Another popular one ( "Type I" ) was used by [25] for an alternate 
type of estimate. That estimate assumes invertibility and is generally less 
efficient than t due to the tapering required to handle nonstationarity. It 
seems likely that the asymptotic theory derived in this paper for r can also 
be established in a "Type I" setting. 

5. Technical lemmas. The proofs of the following lemmas appear in [14]. 

Lemma 1. Under Al 

t-i 
(5.1) et{T) = ^Cj{T)ut-j 

j=0 

with co(t) = 1 where for any 5 £Z, as j —)■ oo, 

sup \cj{t)\ = 0(jmax(5o-5-l,-l-?)^^ 

(5.2) 

SUp|Cj+i(t) - Cj{t)\ = C)(jmax(5o-5-2,-l-?))^ 

Lemma 2. Under Al, A2 

t-i 
^t{T*) = ^ ajEt-j + vt{6), 

j=0 
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where r* = {6, (p^) and for any k, > 1/2 



sup \vt{5)\=Op{t^~^) 

(5o-K<<5<<5o-l/2+r; 



Lemma 3. Under Al, A2 



(5.3) E^^m(^.) = E 



0{e'^^;fo) 



e{e^^j;(p) 
where for any real number k > 1/2 



4(t*)(Aj) + K(t), 



(5.4) sup \Vn{T)\ = Op{log'nl{K = l/2) + n'''-'l{K>l/2)). 

5o-K<S<So-l/2+r] 

Lemma 4. Under A3, given an open neighborhood N^ of radius e < 1/2 
about To, as j ^ CO, 

sup|c,(t)|=0(/-^), 



sup 



5cj(t) 



9(5 



0(/-Mogi) 



sup |c,+i(r) - c,(r)| = 0(j— (-2,-1-.))^ 



sup 



•GAT, 

9(5 



(cj+i(t)-Cj(t)) 



sup 

tGA, 



sup 

xGA, 



d'c^i-r) 



86^ 
dcjir) 



dif 



sup 

TGA, 



sup 

tGAe 



^2 

^(cj+i(r)-Cj(r)) 

d 

— {cj+i{t)-cj{t)) 



d'c,{T) 



sup 

XGA, 



sup 

xGA, 



dip dif' 



dip 85 



o{r^~' + r^ log j), 
o(/-Mog2i), 

o(/-i), 

0(i-^-^+/-'log2i) 

Q/ •max(€-2,-l-?)\ 

o(/-'), 

0(/-Mogi), 
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sup 



sup 



dip dip 
52 



dip 85 



-{cj+i{t) - Cj{t)) 
(cj+i(T)-Cj(r)) 



Q/ •max(e-2,-l-<f)\ 
0{r'-' +f -Hog j). 
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SUPPLEMENTARY MATERIAL 

Supplement to "Gaussian pseudo-maximum likelihood estimation of frac- 
tional time series models" (DOI: 10.1214/11-AOS931SUPP; .pdf). The sup- 
plementary material contains a Monte Carlo experiment of finite sample 
performance of the proposed procedure, an empirical application to U.S. in- 
come and consumption data, and the proofs of the lemmas given in Section 5 
of the present paper. 
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